18.785: Analytic Number Theory, MIT, spring 2007 (K.S. Kedlaya) 

Brun's combinatorial sieve 

In this unit, we describe a more intricate version of the sieve of Eratosthenes, introduced 
by Viggo Brun in order to study the Goldbach conjecture and the twin prime conjecture. 
It is most useful for providing lower bounds; for upper bounds, the Selberg sieve (to be 
introduced in the following unit) is much less painful. 

1 Sieve setup 

Let / : N — > C be an arithmetic function, and suppose we want to estimate the sum of / 
over primes. More precisely, let P be a set of primes, and put 

p{z)= n p- 

p<z,p€P 

If we define 

S{x,z)= f{n), 

n<x,{n,P{z})=l 
n<x,n=0 (mod d) 

(with the dependence on P and / suppressed from the notation), we have 

s{x,z) = E MM^)- 

d\P{z) 

As before, suppose there is a multiplicative function g such that for d squarefree with all 
prime factors in P, 

Ad{x) = g{d)X + rd{x), 

with X = X{x) independent of d, and the error term Vdix) small when d is small relative to 
X (in a sense to be made precise later). Suppose further that 

g{p)e [0,1) ipeP); g{p)^0 {p^P). (1) 

(If we need to take g{p) — 1, then we cannot expect to get much of a contribution from 
numbers not divisible by p; we should resign ourselves to this, and instead remove p from 
P.) Then we can rewrite 

S(x,z) = V(z)X + R(x,z) 
V{z) = n (1 - 9{P)) 

p\P(z) 

R{x,z) = E Kdydix). 

d\P{z) 
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If z is small relative to which in practice will mean 2; < for some cutoff a G (0, 1), we 
may be able to show that the main term V{z)X dominates the error term R{x^ z). Again, 
the main term is what you would predict from the heuristic that if an integer is chosen 
randomly, its divisibilities by different primes should act like independent random events. 

For instance, if P is the set of all primes and z > x^^"^, then S{x, z) = X^p<3, /(p)- If / is 
the function 




1 n — 2 prime 
otherwise. 



then by the error term in the prime number theorem for arithmetic progressions, 

rd(x) — 0{x\og~^ x) 

for any fixed A > 0. (It is now important that we have that bound uniformly in d\) Also, 
S{x, X"^/^) counts twin primes up to x, whereas S{x, a;^/(^+^)) counts primes p such that p + 2 
has no prime factor less than x^^^^~^^\ and hence has at most N prime factors. 



2 Brun's combinatorial sieve 

We would like somewhat finer control than was provided by the sieve of Eratosthenes; the 
trouble is that R{x, z) has too many terms for us to be able to control it. 

Brun's approach to get aronud this is to truncate the Mobius function by restricting it 
to suitable subsets and D~ , subject to the restriction that for n a product of primes in 
P, the incomplete convolutions 

d\n,deD+ d\n,deD- 

satisfy 

S-{n) < S{n) < S+{n) (2) 

for 

a\n V 

One such choice would be to take and D~ to consist of all squarefree numbers whose 
number of distinct prime factors is even or odd, respectively. This choice is much too crude; 
we should instead make a choice that allows some cancellation in 5~ and 5"*" without messing 
up the inequality (2). Moreover, we want to restrict D'^ and D~ to be subsets of {1, . . . , y} 
for some y which is not too large compared to x. 
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Let X'^{d) and A (d) denote the functions which agree with /i on and D , respectively, 
and are zero elsewhere. Put 

d\P{z) 
d\P{z) 

Then by virtue of (2), we have 

V-{z)X + R-(x, z) < S(x, z) < V+(z)X + R+(x, z). (3) 

It is not at all obvious how one can usefully arrange for D~^,D~ to satisfy (2); here is 
Brun's choice. For d a squarefree positive integer, write d = pi ■ ■ ■ Pr with pi > ■ • • > p^. Set 

^{d^Pi---Pr:pm<ym m odd} 
D" ^{d^Pi---Pr:pm<ym m cvcn}, 

where yi,y2, ■ ■ ■ are certain parameters which may depend on d. (By convention, 1 e -D^.) 
We then have the following. 

Lemma 1. With notation as above, let Vn{z) be the sum of g{pi ■ ■ ■Pn)y{Pn) over sequences 
Pi> ■ ■ ■ > Pn of primes such that: 

(a) pi< z; 

(b) Pn > yn, 

(c) Pm < ym for m < n with m = n (mod 2). 
Then 

V{z) = V^{z) - J2 K(^) 

n=l (2) 
n=0 (2) 



and so 

V-{z) < V{z) < V+{z). (4) 
Proof. Exercise. □ 
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In particular, for a given n, we deduce (2) from (4) by rigging up the set P so that 
P{z) — n and putting g{d) — 1 for all d. 

The functions and A~ given above arc together called the combinatorial sieve with 
parameters yi,y2, To use it, one must bound 

d<y,d\P{z) 

for y such that C {1, • • • ,y}; in this case R{x,y) > |i?^(a;, giving error bounds in 
(3). One must also bound V^{z). 



3 Setting some parameters 

To turn this into an actual numerical theorem, we must set the sieve parameters; we do this 
following Iwaniec-Kowalski. Remember that we may allow the yi to depend on d. 
Write d = pi ■ ■ ■ Pr with pi > • • • > p^; we now take 

ym = (y/(pi---Pm)y^^, 

where (3 > 1 will be specified later. This makes it clear that all elements of U belong 
to {1, ... ,y} except possibly for single primes in D". We can remedy this by requiring z < y; 
more precisely, we will take z — y^^^ for some s > (5. 

We will also need to make some restriction on the multiplicative function g. Namely, we 
assume that for some K > 1 and > 0, we have for all w, ^, 



w<p<z 



Wc refer to /t as a sieve dim,ension of the function g. This number is quite critical; it will 
determine how large we can make z compared to y, which determines how many small primes 
we can use for sieving. 



4 Bounding the main term 

We need an upper bound on V~^{z) and a lower bound on V~{z); we get both of these by 
getting an upper bound on Vn{z). First, let us simplify the sum by relaxing the summation 
conditions. We claim that for any tuple pi, . . . ,Pn appearing in the sum defining Vn{z), and 
any m < n, 

Pi---Pm-ipt<y- (6) 
Namely, if m = n (mod 2), we have the stronger inequality 

Pi---Pm-iPm^ <y- 
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If m > 1 and m ^ n (mod 2), we have 

Pi ■ ■ -Pm-ivL < Pi • • •Pm-2P^-i < y- 
Finally, \im — 1 and m^n (mod 2), we have 

< = yPh < y_ 

Prom (6), we deduce by induction on m that 

Pi---Pm< y^-^^-^"')" (m = 1, . . . , n - 1). 

In particular, 

Pn > {y/{Pl ■ ■ -Pn-.))"^'^'^ > y^^^'-'-'^"-' > > 

if we put 



We will now retain only the conditions z > pi >■•■> Pn > Zn on the primes, which will 
make the sum bigger because every summand is nonnegative. That is, 

Vn{z) < ^ g{Pl---Pn)V{pn) 

Z>pi>--->Pn>Zn 



\Zn<P<Z / 

< 1 v(.„) (log 



n\ ' \ ° V{z) 

Here is where we need the assumption (5) about the sieve dimension. It implies 

< K{1 + (/? - 1)-^)"" < Ke""/^ 

V{z) 

for P = nb + 1 (using the bound 1 + a; < for a; = (/5 — = l/(«;6)), which gives us 

K (n 



yn{z)<-{^^+\0gK)\-l'V{^ 



(using the bound W x < iox x = b{logK)/n). Since n\ > e{n/e)"' (by taking logs and 
comparing integrals), we obtain 

Vn{z) < e-^a"i^''+V(^) 
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for a = 6-1 6^+^ \ 

To conclude, we clean things up a bit. Remember that we were at liberty to choose 
/3 > 1, which is equivalent to choosing b > 0. By taking b sufficiently large, we can force 
a < 1; for instance, we could take 6 = 9 to get a < e~^. Note also that because 

Pi > • • • > Pn > Z/n = (y/(pi • • -Pn))^'^, 

we have pl'^^ > y. Since we also have pi < z = y^^^, we deduce that Vn{z) — unless 
n + P > s. Therefore 

5^Kw= y: k(.) < -f^i^''+v(.). 

n>0 n>s-l3 ^ ' 

To conclude, we have the following bound (Theorem 6.1 in Iwaniec-Kowalski). 

Theorem 2. In the combinatorial sieve with parameters yi,y2, ■ ■ ■ as above, and (3 = 9k + 1, 
for any multiplicative function g{d) satisfying (1) and (5) for a given K, and any s > (3, for 
z = y^/^ we have 

V+{z) < (1 + ef^-'K^'')V{z) 
V-{z) > {l + e^-'K^'')V{z). 

Consequently, 

(1 - e^-'K^'')V{z)X - R{x, z') < S{x, z) < {1 + e'^-'K''')V{z)X + R{x, z'). 



5 Consequences for twin almost-primes 

Again consider the example 

, J 1 n-2 prime 
f{n) = < 

I otherwise. 

By applying the combinatorial sieve, we may deduce the following (see exercises). 

Theorem 3. There are infinitely many primes p such that p + 2 is the product of at most 
twenty distinct primes. 

By refinements of the sieving method, Chen was able to prove the following. 

Theorem 4. There are infinitely many primes p such that p + 2 is the product of at most 

two distinct primes. 

This is tantalizingly close to the twin prime conjecture, but it seems that sieving methods 
fall short of delivering that particular prize. 

One can also use the combinatorial sieve to deduce that the number of twin primes < x 
is 0{x/ log^ x); however, since this is a question about an upper bound rather than a lower 
bound, we will be able to derive this much less painfully using the Selberg sieve. 
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Exercises 



1. Prove Lemma 1. (Hint: use the identity 

V{z)^l-Y,9{p)V{p) 

plus inclusion-exclusion.) 

2. Apply the combinatorial sieve to show that the number of integers less than or equal 
to X with no prime factors less than is at least cx/ log^ x for some c > 0. (You will 
need the prime number theorem in arithmetic progressions with error term, in order 
to control the error term R{x, z).) Then deduce Theorem 3. 
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